Basic Statistics

Raw Counts

Name Value
Rows 50,000
Columns 28
Discrete columns 9
Continuous columns 19
All missing columns 0
Missing observations 2,460
Complete Rows 47,734
Total observations 1,400,000
Memory allocation 11.9 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 7 columns ignored with more than 50 categories.
## brewery_city: 2024 categories
## brewery_state: 65 categories
## brewery_country: 112 categories
## brewery_name: 2856 categories
## review_profilename: 8905 categories
## beer_style: 104 categories
## beer_name: 12079 categories

QQ Plot

## Warning: Removed 46 rows containing non-finite values
## (stat_qq).
## Warning: Removed 46 rows containing non-finite values
## (stat_qq_line).

## Warning: Removed 6 rows containing non-finite values (stat_qq).
## Warning: Removed 6 rows containing non-finite values
## (stat_qq_line).

Correlation Analysis

## 8 features with more than 20 categories ignored!
## brewery_city: 1861 categories
## brewery_state: 65 categories
## brewery_country: 111 categories
## brewery_name: 2562 categories
## review_profilename: 8784 categories
## beer_style: 104 categories
## beer_name: 10578 categories
## beer_category: 47 categories

Principal Component Analysis

## 7 features with more than 50 categories ignored!
## brewery_city: 1861 categories
## brewery_state: 65 categories
## brewery_country: 111 categories
## brewery_name: 2562 categories
## review_profilename: 8784 categories
## beer_style: 104 categories
## beer_name: 10578 categories